Tight upper tail bounds for cliques 

B. DeMarco* and J. Kahn^ 
Abstract 

With = £.k'^ the number of copies of Kk in the usual (Erdos- 
Renyi) random graph G{n,p), p > n"^/'^'^""'^' and 77 > 0, we show when 
k > 1 

Pr(a > (1 + ?7)ECfc) < exp [-a,,fe niin{nV"' log(l/p), nV^'H 
This is tight up to the value of the constant in the exponent. 

1 Introduction 

Let G{n,p) be the Erdos-Renyi random graph on n vertices, in which every 
edge occurs independently with probability p, and let H he a fixed graph 
with vh = \ V{H)\ and en = \E{H)\. A copy of H in G{n,p) is any subgraph 
of G{n,p) isomorphic to H. It has been a long studied question (e.g. [U 
El EH 111 da [m IS]) to estimate, for > and Ch = Cif the number of 
copies of H in G{n,p), 

Pv{^H> {1 + vMh). (1) 

To avoid irrelevancies, let us declare at the outset that we always assume 
p > ?z~t/™flj where, as usual (e.g. [lOl p.6]), 

niH = ioiax{eK/vK '■ K C H} (2) 

(so threshold for "G 5 i/"; see |10^ Theorem 3.4]); in particular, 

when H = Kj. we assume p > n~'^^^^~^\ For smaller p the problem is not 
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very interesting (e.g. for bounded rj the probability in ([T]) is easily seen to 
be @{mm{n''^p''^ : K O H,eK > 0}); see [TOl Theorem 3.9] for a start), 
and we will not pursue it here. 

Janson and Rucihski [12] offer a nice overview of the methods used prior 
to 2002 to obtain upper bounds on the probability in ([T]), by far the more 
challenging part of the problem. To get an idea of the difficulty, note that 
even for the case that H is a triangle, only quite poor upper bounds were 
known until a breakthrough result of Kim and Vu [15], who used, inter alia, 
the "polynomial concentration" machinery of [13] to show, for p > logn, 

expp[0^(nV)] < Pr(CH > (1 + riMn) < exp[-f]^(nV)]- (3) 

(The easy lower bound, seemingly first observed in [16], is, for example, the 
probability of containing a complete graph on something like (1 + r])^^^np 
vertices. Of course the subscript r] in the lower bound is unnecessary if, for 
example, i] < 1, which is what we usually have in mind.) 

The result of [15] was vastly extended in a beautiful paper of Janson, 
Oleszkiewicz and Rucihski [llj . where it was shown that for any H and t], 

e^Pp[OH,n{MH{n,p))] < Pr{CH > (1 + 7?)ECh) < exp[-f]H,r,(Mj^ (n,p))], 

(4) 

thus determining the probability ([T|) up to a factor 0(log(l/p)) in the ex- 
ponent for constant r]. A definition of M is given in Section [TOt for now we 
just mention that (for p > n~'^/^^~^^) MK^{'n,p) = v?p^~^. 

While it seems natural to expect that the lower bound in ^ is "usually" 
the truth (see Section [10] for a precise guess), the only progress in this 
direction until quite recently was achieved in [13], which established the 
upper bound exp[— r2(MH(n,p) log^''^(l/p))] for H = K^oi C4 (the 4-cycle) 
and some values of p. 

The log(l/p) gap was finally closed for the case H = by Chatterjee 
[5] and, independently, the present authors [6]. More precisely, [5] showed 
that for a suitable C depending on r] and p > Cn~^ logn, 

Pr(a3>(l+r?)Ea3)<p^"("'^''), 
while [6] showed, somewhat more generally, that for p > n~^, 

exp[-0^(/(3,n,p))] < Y>T{iK, > (1 + v)^Ck,) < exp[-J7^(/(3, n,p))], 

where f{k,n,p) := minjn^p'^"^ log(l/p), n'^p^z)}. (In what follows we will 
often abbreviate f{k,n,p) = f{k,n).) 
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In this paper we considerably extend the method of [6] to settle the 
problem for general cliques and a bit more. 

Theorem 1.1. Assume H on k vertices has minimum degree at least k — 2 
(that is, the complement of H is a matching). Then for all rj > and 
p>n-2/(fc-i), 

Pr (e^f > (1 + vMCh)) < exp [-nr^Mfik, n, p))] ■ 

Theorem 1.2. For H = and for all p > n^'^/^^^'^\ 

Pr(eH > 2E(eH)) >exp [-Oh(/(A:, n,p))] . 

Remarks. 1. We are most interested in the "nonpathological" range where 
f{k,n,p) = n^p'^~^ log(l/p), so when p > n~^/('^~^)(log n)^/[('"'~^)('^~^)] (or a 
bit less). It may be helpful to think mainly of this range as we proceed. 

2. Though mainly concerned with the case H = Kk in Theorem II. H we 
prove the more general statement for inductive reasons. For noncliques the 
bound of Theorem 11.11 is not usually tight; more precisely: it is tight (up 
to the constant in the exponent) if p = f^(l) or if A := A// = k — 1 and 
p = r2(n~^/^), in which cases our upper bound agrees with the lower bound 
in it is not tight if A = A: — 2 and p = o(l) (see the proof of Lemma l2.4p 
or if if 7^ Kk and p < n~^^^ for some fixed c > 1 (see the proof of Lemma 
12.51 in Appendix |B1 in fact p = o(n~^/^) is probably enough here — which 
would complete this little story — but we don't quite show this). 

In the next section we show that Theorem 11.11 follows from an analogous 
assertion for /c-partite graphs; most of the paper (Sections [3]l8] and the two 
appendices) is then concerned with this modified problem. Section [9] gives 
the proof of Theorem 11.21 and Section [10] contains a few concluding remarks. 

2 Reduction 

For the rest of the paper we set t = log(l/p) and take to be a graph with 
vertices vi,V2, . . . ,Vk. We define G = G{n,p,H) to be the random graph 
with vertex set V = Vi U • • • U V^, where the V^'s are disjoint n-sets and 
Pr{xy S E{G)) = p whenever x G Vi and y £ Vj for some ViVj S E{H), 
these choices made independently. We define a copy of H in G to be a set of 
vertices {xi, . . . jX^} with Xi £ Vi and XiXj G E{G) whenever ViVj S E{H); 
use for the number of such copies; and set ^{H,n,p) = E(X^'^) = 
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^kpe{H)^ When there is no danger of confusion we will often use — or, 
for typographical reasons X{H,n) — for and ^{H,n) for ^{H,n,p). 

The next two propositions show an equivalence between G{n,p) and 
G with regard to upper tails for subgraph counts. In each we set a = 
\Ant{H)n'^\/{kn)k ~ fc~^|Aut(if)| (where as usual {a)h = a{a - 1) • • • (a - 
6 + 1)). 



Proposition 2.1. For i] > and e = r]/{2 + rj), 

PrM^ > (1 + e)^{H, n,p)) > Pr(e^'^'^ > (1 + vM^h'')) 

1 — a + ae 



Proposition 2.2. For any e > there is a C = Cs,h such that for p > 

Pr (X^'^ > (1 + e)^(H,n,p')) < 2Pr(^^"'^ > (1 + ae/2)E(e^"'^)). 

(See ([2]) for mn-) We omit the proof of Proposition 12.11 since it is a straight- 
forward generalization of the case H = proved in [6]. Proposition 12.21 is 
proved in Appendix A. 

According to Proposition 12.11 Theorem 11.11 will follow from the corre- 
sponding k-partite statement, viz. 

Theorem 2.3. If H has minimum degree at least k — 2, then 

(a) for all e > 0, 

Pr(X^'^> (l + e)M/(//,n,p)) <exp[-nH,e{f{k,n,p))]; 

(b) for any r > 1, 

Pr (X^-^ > 2rM/(F,n,p)) < expi-QHifik^nr'^^p))]. 

Note that (b) for a given H follows from (a), since (noting that t^{H, n) = 
^{H,nT^^^) and using (a) for the second inequality) 

Pr (X^ > 2r^'(//,n)) < Pr (x^^'^' > 2^{H,nT^/''] 



< 



exp 



nH(f{k,7^T'/'',p) 



We include (b) because it will be needed for induction; that is, for a given 
H we just prove (a), occasionally appealing to earlier cases of (b). 
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We have formulated the theorem for ah p so that the inductive parts of 
the proof don't require checking that p fahs in some suitable range. Note, 
however, that for the proof we can assume (for our choice of positive con- 
stants C and c depending on H and e) 

since for smaller p (> 7?,"^/™^) the theorem is trivial, and 

p < c, (6) 

since above this the desired bound is given by ([!]). As detailed in the next 
two lemmas, dH), together with some auxiliary results from [11], also allows 
us to ignore certain other cases of Theorem 12. 3l f a). 

Lemma 2.4. // A^/ < k — 2 then 

Pr (X^'f > {l + e)^{H,n,p)) < p^nAn^p"''). 
Proof. By Proposition 12.21 it is enough to show 

Pr {Ch" > (1 + < p'^^'^^"'^'"'); (7) 

but this follows from Q, which since Mij{n,p) > n^p^" (see \\.\\ Lemma 
6.2]), bounds the left side of ^ by 

exp[-0^^,,(nV^)] < exp[-l^^^,,(nV^'t)]- 

I 

Lemma 2.5. For any H ^ on k vertices and^ > 0, ifp < n~^^^'^^^^^~^\ 
then 

Pr(X^ > {l + e)^{H,n)) < p^^An^v'^-^) _ 
This is proved in Appendix B. 

3 Large deviations 

This section collects a few standardish large deviation basics that will be 
used throughout the paper. It's perhaps worth noting that these elementary 
inequalities are the only "machinery" we will need. 

We use B{m,a) for a random variable with the binomial distribution 
Bin(?7T,, a). The next lemma, which is easily derived from [21 Theorem 
A. 1.12] and [101 Theorem 2.1] respectively (for example), will be used re- 
peatedly, eventually without explicit mention. 
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Lemma 3.1. There is a fixed C > so that for any K > 1 + X, m and a, 

Fr{B{m, a) > Kma) < mm{{e/K)^^""^,exp[-CX^Kma]}. (8) 

Remark. We may assume Kma > 1. Thus, if ema'^ < 1 then e/K < a^"'^ 
and the bound in ([8]) is at most Q,{i-c)^ma^ 

The next lemma, an immediate consequence of Lemma 13.11 (and the 
above Remark), will also be used repeatedly, usually following a preliminary 
application of Lemma |3. II to justify the assumption eng^ < 1. 

Lemma 3.2. Fix c < 1 and assume enq'^ < 1. If S Vi is random with 
Pr{x G S) < q \/x £ Vi, these events independent, then for any T, 

Pr{\S\ >T)< 

We also need the following inequality, which is an easy consequence of, 
for example, O Lemma 8.2]. 

Lemma 3.3. Suppose wi,...,Wm S [0,-z]- Let ^^i, . . . be independent 
BernouUis, (, = Yl ^'^^ — ^- Then for any 7] > and A > 

Pr(^ > ^ + A) < exp[-0^(A/z)]. 

4 Outline 

In this section we list the steps in the proof of Theorem I2.3l fa). filling in 
some definitions as we go along. The proof proceeds by induction on (say) 
k'^ + en, so that in proving the statement for H we may assume its truth 
for all graphs with either fewer than k vertices or with k vertices and fewer 
than en edges. The case A: = 2 is trivial and A; = 3 is the main result of [B], 
so we assume throughout that /c > 4. 

Most of the proof (Lemmas 14. m4.3| ) consists of identifying certain anoma- 
lies, for example vertices of unusually high degree, and bounding the number 
of copies of H in which they appear. The remaining copies are then easily 
handled (in Lemma l4.4p using Lemma |3.3[ 

Here and throughout we use C and for (positive) constants depending 
on (respectively) H and {H,e), different occurrences of which will usually 
denote different values. Similarly, we use Q and il.^ as shorthand for and 
^H,e- We say an event E occurs with large probability (w.l.p.) if Pr(£^) > 
1 — exp[— r2e(n^p'^~^t)], and write "a <* /3" for"w.l.p. a < f3" (where e 
is as in the statement of the theorem). Note that ([5]) (with a suitable C) 
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guarantees that an intersection of, for example, n w.l.p. events is itself a 
w.l.p. event, a fact we will sometimes use without mention in what follows. 

By Lemma 12.41 we may assume A// = k — 1. We reorder the vertices of 
H so that k — 1 = d{vi) > d{v2) > . . . > d{vk) and if d{v2) = k — 2 then 
V2 9^ "^3- We set A = Vi, B = V2, C = V3 and always take a, b and c to be 
elements of A, B and C respectively. For disjoint X, y C 1/ we use V(X, Y) 
for the set of edges with one end in each of X and Y , and V(X) for the set 
of edges with one end in X. We use N{x) for the neighborhood of (set of 
vertices adjacent to) a vertex x. 

For K ^ H with vertex set {vi : i € T} (T C [A;]), define a copy of K 
in G (= G{n,p, H)) to be a set of vertices {xi : i £ T} with Xi £ Vi and 
j;jrEj S E{G) whenever ViVj S E{K). For xi,X2, ■ ■ ■ ,xi vertices belonging to 
distinct V^'s we use Wj^(xi, . . . , xi) for the number of copies of K containing 
xi,...,xi; when K = H we call this the weight of {xi, . . . , x;}. We use 
Hs = H — {vi : i £ S} {S C [k]), and abbreviate ifjj} = if^, WHsi') = '^si') 
and W0(-) (= wh(-)) = w(-)- 

Set = me and define 5 by (1 + 5)^ = 2. For x G and i G [A;], let 
di{x) = \N{x) n Vi\, and set ^(x) = max{{di{x)) : i £ [A;]}. Say a vertex x 
is high degree if d(x) > (1 + 5)np, and a copy of H is iype one if contains a 
high degree vertex from A, B or C 

Lemma 4.1. W.l.p. G contains less than 7-d^{H,n) type one copies of H . 

Let A' , B', G' denote the subsets of A, B, G respectively of vertices which 
are not high degree. For vertices x,y £ G let dj{x,y) = \N{x) n N{y) n Vj\ 
and d{x,y) = maxj>i{dj{x,y)). A pair of vertices (x,y) is high degree if 
d{x,y) > np'^/'^. For k > 4 a copy of H is type two if it contains a high 
degree pair (x, y) belonging to either A' x G' or x C"; for A; = 4 we don't 
need this, and simply declare that there are no copies of type two. 

Lemma 4.2. W.l.p. G contains less than 2'd^{H,n) type two copies of H . 

Set s = min{t, n'^~^pv ^ "^1) the two regimes corresponding to the two 
ranges of f{k,n,p) (= ri^p^^^s). Define w*(-) in the same way as w(-), but 
with the count restricted to copies of H that are not type one or two. Set 

( 2,^-'^-^{H,n,p)/{v?p^-^s) iik>b 
^ ~ \ 22b^{H,n,p)/{n'^p^s) if A: = 4 ^ ' 

and (in either case) say ab £ V(A, S) is heavy if \N*{a,b) > Finally, say 
a copy of H is type three if it is not type one or two and contains a heavy 
edge, and type four if it is not type one, two or three. 
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Lemma 4.3. W.l.p. G contains less than M'^{H.,n) type three copies of 
H. 

Lemma 4.4. With probability at least 1 — exp[—il.£(f{k,n,p))] G contains 
less than (1 + 2'd)^{H,n) type four copies of H . 

Of course Theorem l2.3f a) (for /c > 4) follows from Lemmas 14. 1114. 4t these 
are proved in the next four sections. 

5 Proof of Lemma 14.11 

For i e [3] set Di{i) = {x G Vi : d{x) > np^/^} and D2{i) = {x G Vi : 
np2/5 > d{x) > (1 + 5)np], and for j G [2] set Sj{i) = Y.{d{^) ■ ^ ^ Dj{i)}. 
We will show 

Proposition 5.1. For all 1 < i < 3, 

w.l.p. yx£Dj{i), w(2;)/(i(x) < I 2„;-„2^^(^)-fc+2(fc-l)/5 ifj = 2 
and 

Proposition 5.2. For all I < i < 3, 

The lemma follows since the number of type one copies of H is at most 

^ w(x) <* ^(5i(z) • 27i'=-V(^)-(^^-i) + S2{i) • 2n'^-V(^)-'=+2('^-i)/5) 

3;;3;high degree i=l 

<* 3(2i9^(F,n) + 2A:^'(/7,n)/('=-i)/5-it) 
< 7i?^(F,n), 

using Propositions 15.11 and 15.21 for the first and second inequalities. I 

Proof of Proposition 15.11 Fix i and condition on V(l^) (thus determining 
Di[i) and D2{i)). If (i(fj) = A; — 1, then for any x G Di{i), induction gives 

Pr(w(x) > 2^{H„d{x))) < exp[-0(/(fe - l,d{x)))], 
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whence (noting "if {Hi,-) = •)) 

Fi{3x G Di{i) : w(x) > 2^(Fi,d(x))) < nexp[-0(/(A: - l,d{x)))] 

<P'''P'"\ (11) 

Similarly, 

Pi(3x e D2(i) : w(x) > 21'(i7i,np2/^)) < nPr(X^f''' > 2^'(iJi, n//^)) 

< nexp[-J7(/(A;- l,np2/5))] 

< (12) 



Note that, here and throughout, we omit the routine verifications of inequal- 
ities like those in the last lines of (|lip and (|12p . 

If d{vi) = k — 2, then Vi ^ Vj for some j G [A;]. We partition Vj = 
Pi U • • • U P[i/p] with each of size at most (1 + 6)np, and write w^(x) for 
the number of copies of H containing x and meeting P^. Noting that here 
^{Hi, •) = p~^^{Hi, •) (and w(x) = Y^^v/ix)), we have 

Pr (w(x) > 2^(/7i, d(x))) < Pr(3^ w^(x) > 2^(Fi, (i(x))) 

<p-'exp [-n{f{k-l,d{x)))] 

for a given x, so that 

Pr (Bx G P»i(z) : w(x) > 2^^ (Fi, (i(x))) < np'^ exp [-f^(/(A; - 1, d(x)))] 

<p"'*''^\ (13) 

and 

Pr(3x G P)2(i) : w(x) > 2*(i/i, n//^)) < Pr(X]^f > 2^'(/?i, n//^)) 

< exp[-n{f{k - 1, n//^))] 
<p"'f'"\ (14) 



Finally, ([TT])-([T1D imply that w.l.p. 

w{x)/d{x) < 2^{Hi,d{x))/d{x) = 2{d{x)f-^p''^^^-^^-^^ /d{x) 

<2n^-'^p<^^-^^'^^ yx^Di{i) 

and 

j{x)/d{x) < 2^{Hi,np^l^)/d{x) = 2(np2/5)'=-yW-(fc-i)/d(x) 

< 2n^-''p<^^-^+^^^'^^'^ Vx G 1)2 (i). 



w 
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Proof of Proposition 's^ We bound |V(-Dj(i))|, which is, of course, an upper 
bound on Sj{i). We first assert that, for any i G [3], w.l.p. 

\Di{i)\ < and \D2{i)\ < np^'H. (15) 

This will follow from Lemmas 13.11 and 13.21 (so really two applications of 
Lemma 13. ip , a combination we will see repeatedly. For a given i and j the 
events {x E Dj{i)} (x G V^) are independent with (using Lemma [3TT]) 



Pr(x G Di{i)) < kPT{B{n,p) > np^^^) < A:(e//5)V/^ < pO.Snp^/s 

and 

Pr(3; G D2{i)) < kPi{B{n,p) > (1 + 6)np) < exp[-Q{np)]. 
An application of Lemma 13.21 now shows that (jlSp holds w.l.p. 



Assume then that (|15p holds, and for convenience rename its bounds 
_ g^j-^j np^~'^t = u; we may of course assume r > 1 if proving the 
first bound in (jlOp and u > 1 if proving the second. We have (a bit crudely) 

Pr(|V(L'i(i))| > i^nV^i) < Pr(3r G (^i'^) : |V(r)| > t?nV"^) 

< (;;?) Fr{B{{k - l)rn,p) > ??nV"^) 

< n"(e(fc - l)p3/5)i?nV-i 

and 

Pr(|V(Z)2(i))| > kn'^p^^h) < Pr(3T G (^(*)) : |V(r)| > kn^p^~H) 

< (l) Fr:{B{{k - l)un,p) > kn'^p^'H) 

< n"exp[-l^(nV^^i)] 

with the third inequality in each case given by Lemma |3. 11 



10 



6 Proof of Lemma 14.21 

(Here we are only interested in A; > 5.) We bound the contribution of high- 
degree (^', C")-pairs, the argument for (i?', C")-pairs being similar. 

Let A" be the (random) set of vertices of A' involved in high-degree {A' , C')- 
pairs — that is, A" = {a ^ A' : 3c ^ C d{a,c) > Jip^/^}— and define C" 
similarly. We will show that 

w.l.p. |A"|,|C"| < n/-5/2 (ig) 

and 

w.l.p. w(a, c) < 2t1'(F{i^3}, (1 + 6)np) V(a, c) e A' x C . (17) 

Combining these we find that the total weight of high degree {A! , C")-pairs 
is w.l.p. at most 

{np^-^l'^f2t^{H{^^^},{l + 6)np) < 4nV''"^i^(^{i,3}, n) < i?^(F,n), 

where the second inequality uses ^'(if{i^3}, n) < n~^p~^'^^~^^^{H,n) and 
4:p^~H < d (see Since, as noted above, the same argument shows that 
the contribution of high-degree (i?', C")-pairs is w.l.p. at most 'd^{H,n), 
the proposition follows. 

Proof of (jl6p . Given V(C), the events {a £ A"} are independent, with 

Pr (a G A") < n{k - 2) Pr [5((1 + 6)np, p) > np^/"^] 

< n(A;-2)(e(l + 5)pi/2)np-V2 ^ ^o.4np3/2 _ 

where we use ([5D, 1^ and A: > 5 for the last inequality. Thus, since enq^/'^ < 
1, Lemma [3.21 gives ()16p for A", and of course the same argument applies to 
C". I 

Proof of (|17p . Here we have lots of room and just bound max{w3(a) : 
o E A'}, a trivial upper bound on max{w(a, c) : a E A',c E C"}. Since 
d{a) <{l + d)np (for a E A') and vi ~ E [k] \ {2, 3}, Theorem [231^b) 

gives (inductively) 

Prpa E A' W3(a) > 2m{H{i^s}, (1 + S)np)] 

< nexp[-n{f{k - 2, (1 + 6)nptT^))] < pf^("V~^) 

(with verification of the second inequality, which does need ([5]) at one point, 
again left to the reader). I 
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7 Proof of Lemma 14.31 



This requires special treatment when = 4; see the beginning of Section F7.2I 
for the reason for the spht. In Sections 17.11 and [7.21 we set A" = {a : di{a) < 
(1 + 5)np Vi > 3} 5 A' and define B" similarly. 

7.1 Proof for > 5 

For reasons that will be explained as we proceed, we need somewhat diff'erent 
arguments for large and small values of p. 

Case 1: np^^'^^/^ > log^ n. Let Cf, = {c G C n Nib) : d{b, c) < 7ip^/^} and 

W{A) = {a:3beB", ^ wi(6, c) > C} 5 {a : 36, w*(a, 6) > C} 

ceCtr\N{a) 

(see dHl for (), and define W{B) similarly. 

Remark. While it may seem more natural to define W{B) in terms of 

w{a, b) or w*{a, b), the present definition has the advantage of not depending 
on V(VF(^), W{B)). We will see something similar in Case 2. 

The point requiring most work here is 

w.l.p. \W{A)\,\W{B)\ Ki^np^'^-^y^t^. (18) 

Given this, the rest of the argument goes as follows. According to Lemma 
EH (HID implies 

w.l.p. |V(Ty(^),P^(S))| < t??iV"^ (19) 

(since, given the inequality in ([TH|), | V(VF(^), VF(i?))| ~ B{m,p) for some 
m < 'd'^n'^p^~^t^; note the inequalities in ([18]) and ^T9\i depend on separate 
sets of random edges). On the other hand, an inductive application of 
Theorem 12. 3f b) gives 

w.l.p. w*(a,6) < 2^(i?|i^2},(l + 5)f^p) Va,6 (20) 

(using the fact that we are in Case 1 and noting that d{a) > (1 + 6)np 
implies w*(o, 6) = 0). 

Finally, the combination of ()19p and (j20p bounds the number of type three 
copies of H by i9nV~^ • 2^(-H'{i,2}, (1 + S)np) < 4i?^(i?, n). I 
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Proofs of the two assertions in (jlSp being similar, we just deal with 
W{A). We first show 

w.l.p. wi(6, c) < 2tn^-^p''"-^^^-^'il'^ =: 7 V6 G B" and c E (21) 
and 

w.l.p. wi(6) < 4n^-V-^"^^"^^ \/beB". (22) 

These will imply, via Lemma |3.3| that the events {a E P^(74)} are unlikely, 
and then (jlSp will be an application of Lemma 13.21 

Each of ()2ip and ()22p is given (inductively) by Theorem 12.3( b). with 
small differences in arithmetic depending on d{v2) and d{v3): say we are in 
(a),(b) or (c) according to whether {d{v2) , d{v^)) is (/c — 1, /c — 1), {k—\,k — 2) 
or {k -2,k- 2). 

For ([2T]) we first observe that, given \7{B U C) and c E C;,, wi(6, c) is 
stochastically dominated by X := X(ff|| 2,3}; ^P^^^) ™ (a) (c), and by 
the sum of [1/pJ independent copies of X in (b). (For the latter assertion, 
let i be the index for which vs ^ ve and, recalling that b E B", partition 
N{b) n = Vi U • • • U V^[i/pj with each block of size at most np^^'^.) Theorem 
12.3( b) thus gives the upper bound 

n\l/p\ exp[-n{f{k - 3,7i//2ii/(fc-3))] < p^CnV-^) (23) 

on either 

Pr(36 E B",ce Ct : wi(6,c) > 2m{H{^^2,3},np^/^)) 
(if we are in (a) or (c)) or 

Pr(36 E S",cE a : wi(6,c) > 2t[l/p\^{H^^^2,3},np^/^)) 

(if we are in (b)), the inequality in ()23p holding because we are in Case 1. 
(Note that in (p3|) the [l/p\ is needed only when we are "in (b)," and the 
term involving t only when k = 5.) 

To complete the proof of ()2ip it just remains to check that 7 (recall this 
is the right hand side of (I2ip ) is an upper bound on 2t"^{H^i 2^3},np^^^) if 
we are in (a) or (c), and on 2t[l/p\^{H^i 2^3},np^^'^) if we are in (b). I 

The proof of ()22p is similar. Here, because we are in Case 1, Theorem 
12.3( b) gives the bound 

n[l/p\ exp[-n{f{k - 2, (1 + 6)np)] < expp[-J](nV~^)] 
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on Pr(36 G B" wi(6) > 2^(//{i^2}i (1 + ^)np)) if we are in (a) or (b), and 
on Pr(36 G B" wi(6) > 2[l/pJ ^(i7{i^2}, (1 + 5)np)) if we are in (c); and it's 
easy to check that 2^'(iJ{i_2}, (1 + ^)np) or 2[l/p\'^{H{i 2}, (1 + ^)np) (as 
appropriate) is less than ^n^~'^p^H~^^~^\ I 

Finally we return to (jlSp . Fix (and condition on) any value of E{G) \ 
V(A, C) satisfying the inequalities in (|2ip and (|22p . It is enough to show 
that, under this conditioning and for any a, 

Pr(a G W{A)) < exp[-0(np(^-^)/V*^)] =: 9, (24) 

since then Lemma 13.21 implies, using enq^^^ < 1 and the fact that the events 
{a G VF(^)} are independent, 

\W{A)\ <* T?np('=-^)/2^l 

(The assertion enq^^^ < 1 (or enq"^ < 1) imposes the most stringent require- 
ment on p for Case 1.) 

For (I24p we observe that (I22p gives (for any a and b & B') 

E( ^ wi(6,c))=p J^wi(6,c)<pwi(6)<4n'=-V«-'=+2<C/2, 

whence, using Lemma 13.31 with ()2ip . we have 

Pr(a G VF(^)) < Pr(36 G B" ^ wi(6, c) > C) 

cGCi,nAf(a) 

< nexp[— ri(C/7)] < nexp[— 

< exp[-J7(np(^"i)/Vi^)]- 



Case 2: np^^~^^^'^ < log^ n. Recall that for very small p — in particular for 
p in the present range — and H ^ K^, Theorem 12.31 is contained in Lemma 
12. 5t we may thus assume H = K^. Let H' = H — viV2 and, writing w' for 
wh', set 

W{A) = {a:3be B",W{a,b) > C} 5 {a : 3b \N*{a,b) > (}, (25) 

and define W{B) similarly. (We could also work directly with w(a, b) and 
avoid the extra definitions; but the present treatment, which we will see 
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again below, is more natural in that it allows us to ignore the essentially 
irrelevant V(^,i3).) 

The argument here is similar to that for Case 1. We again show that 
membership in VF(^), W{B) is unlikely, leading to 

w.l.p. \W{A)\,\W{B)\ <log^n, (26) 

which, in view of Lemma |3.H again gives 

w.l.p. \V{W{A),W{B))\ K^n'^p''"^. (27) 

On the other hand we will show, by an argument somewhat different 
from others seen here, 

w.l.p. w*(a,6) < n'="2p('"^') Va,6. (28) 

Combining this with (j27p gives Proposition 14.31 (for the present case). 

Proof of (|26p . Of course it's enough to prove the assertion for Pt^(A). We 
first observe that 

w.l.p. wi(6) < 2m{H{^^2}, (1 + S)np) < 4tlog^'=-^n =: m V6 G B"; (29) 

as elsewhere, this is given by an inductive application of Theorem 12.3( b). 
which says that, for any b G B", 

Pr(wi(6) > 2t^{H{i^2}, (1 + S)np)) < exp[-n{f{k - 2, (1 + 6)npt^/'^''-^^))] 

(Note that for very small p the extra factor t in (|29p — which did not appear 
in (j22p — is needed for the final inequality here.) 

We now condition on E{G) \ V{A) and assume that, as in ([29|) . wi(6) < 
m\/b £ B" . Note that a G VF(^) means (at least) that there is some b G B" 
with 

W{a,b)>3''^^. (30) 

For i G {3, . . . ,k} (and any b), let V^*{b) be the set of vertices of Vi lying on 
copies of Hi that contain b. Since 

w'(a,6)<nt3l^(«)nV;*(6)|, 

(I30]l at least requires | iV(a) n (U^Lg 1/^(6) )| > 3(A;-2); so the probability (for 
a given a) that there is some b for which (j30p holds is at most 

nFr{Bi{k - 2)m,p) > 3{k - 2)) < p-{^-i)/2+(i-o(i))3(fc-2) ^ ^fc-i ^. ^ 
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But then, since (say) enq^^^ < 1, Lemma 13.21 gives (j26p . 



Remark. Of course (I28p is the counterpart of (I20p of Case 1 (since is now 
-RTfc the two bounds differ only by small constant factors); but for very small 
p the simple inductive derivation of pop using Theorem 12.3( b) no longer 
applies, since f{k — 2, (1 + 5)np) may be much smaller than f(k, n). 

Proof of (128p . We may assume b £ B' as otherwise \N*{a,b) = 0. For 
i£{3,...,k} let 

V*{a, b) = {v £ Vi : some copy of H on a,b contains v}. 

We will show that 

w.l.p. \V{V*{a,b),V*{a,b))\ < n^p^'^ yi,j,a and b G B'. (31) 

That this gives (I28p is essentially a special case of a theorem of N. Alon 
[1], the precise statement used here (see the proof of Theorem 1.1 in [7]) 
being: an r-partite graph with at most £ edges between any two of its parts 
contains at most i'^^'^ copies of Kr- 

For the proof of (j3ip we fix a, b and i < j, and think of choosing edges 
of G in the order: (i) V(6, V3 U • • • U Vfc); (ii) V(Va, Vp) for all 3 < a < /3 < /c 
except {a, P) = (iii) V{a,ViUVj); (iv) V{Vi,Vj). (The remaining edges 

are irrelevant here.) 

Let H" = Hi — ViVj. Since b £ B', Lemma 12.51 gives (since we are in 
Case 2) 

\NH"{b) <* 2^'(ifi,2 - ViVj, (1 + S)np) =: m. (32) 

Let V* be the set of vertices of Vi contained in copies of H" that contain b, 
and define V^* similarly. 

If the bound in (132p holds, then each of VJ*,Vj* has size at most m < 
p~^ log*^^^^ n; an application of Lemma 13.11 thus shows that w.l.p. each of 
N{a) n V*, N{a) n V* (and thus also V*{a,b), V*{a,b)) has size at most 
(say) p~^^^, and a second application gives ()3ip . 



7.2 Proof for = 4 

For A; = 4, as in Case 2 above, we can't simply invoke induction to obtain 
(j20p . since /(2, (1 + 5)np) (~ v?p^) is smaller than /(4, n). This is the main 
reason a separate argument is needed for A: = 4. 
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Proof. We consider the possibilities H = and H = {K4 with an edge 
removed) separately. 

Case 1. H = K4. Now ab is heavy if w*(a, 6) > 22h-n?p^/s. Here it will 
be helpful to work with w rather than w*. We treat (heavy) a6's with 
w(a, h) > n?p^ and those with w(a, , n'^p'^] separately. 

To bound the contribution of edges of the first type, set 

A* = {a:3be B" , w'{a, b) > n V} D {a : 36 G B' , w(a, b) > nV} 

(where w' is as in the paragraph containing (|25p ). and define B* similarly. 
We first show 

w.l.p. \A*\,\B*\ <np'^/^. (33) 

To see this (for A*, say) we condition on the value of V{B,C U V4) and 
consider Pr(a G A*). Noting that for any a and b E B" , 

Pr(w'(a, b) > n^p^) < Pr((i(a,6) > n//^)+Pr( w'(a, b) > ii^p^ \d{a,b) < np^/^) 

(where 5/4 is just a convenient value between 1 and 3/2), we have 

Pr (a G A*) < n[2Pr(5((l + 6)np,p) > np^l^) + Pr(B(nV^^P) > "-V)] 

^ ^Q. {np-/i ) _^ ^Q. (n2p3 ) ^ (^^^^ 

Since (given V(i?, CUV4)) the events {a G ^*} are independent, Lemma [3.2l 
now gives (p3|) . (Note that by (l5|) the second term in (p^ is less than (say) 
n~^; thus when this term dominates (j34p . Lemma 13.21 gives A* = w.l.p.) 
On the other hand, again using Lemma |3.H we have 

Pr(3a G ^,6 G 5' : w(a,6) > nV*) < Pr(5((l + (5)^nV,p) > "-V*) 

and combining this with (I33p gives 

^{w*(a,6) : w(a,6) > ?iV} <* \A*\\B*\r?p^t <* n^p^'h (< i?nV)- 

For a6 of the second type (i.e. with w(a,6) G {225n'^p^ / s,n'^p^]), we 
take J = 15np^/^/^, set = {a : 36 G B",d{a,b) > J}, and define 
similarly. Given ^{B^C U V4) the events {a G Aj} are independent with, 
for each a, 

Pr(a G Aj) < 2nPr(B((l + 5)np,p) > J) < 2np^^~°^^'>^-^/^ =: q. 
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(using e(l + 5)np^^'^'^°^^^ < J for the second inequaUty). Since enq^/'^ < 1 
(to see this, note J is always at least 15, and is n^^^^ if p > n"^/'^"''^^^)). 
Lemma 13.21 gives 

\Aj\ <* yfdn^p^jJ. 

Of course an identical discussion applies to so we have <* 

'ds-n?'p^ and, by Lemma [3?T| 

\V{Aj,Bj)\ <* ^nV. 

Thus, finally, 

^^{w*(a, 6) : ab heavy, w(a,6) G (n^p^/s, n^p^]} 
<* |V(^j,5j)|nV =i?^V 



Case 2: H = . Recall that ^3^4 is the missing edge and an edge ab is 
heavy if w*(a,6) > 22^^ {H,n,p)/{v?p^s) = 225n^p^/s. We proceed more 
or less as in the second part of Case 1. 

Set J = Ihnp/y/s, = {a : 36 G B",d{a,b) > J} and Bj = {b : 3a £ 
A",d{a,b) > J}. Given V{B, C U V4) the events {a £ Aj} are independent 
with, for each a, 

Pr(a G Aj) < 2nPr(B((l + 6)np,p) > J) < 2np-^/'^ < p'^^^ =: q 

(using Lemma [3. II and J > ep~^^^{l+6)np'^ for the second inequality). Since 
(say) enq^^^ < 1, Lemma [32] gives 

\Aj\ <* nV/J, 

and similarly for Bj. Since ab heavy at least requires a G Aj,b G Bj and 
a £ A' (and since a £ A' implies w(a, 6) < ((1 + 6)np)'^), this says that the 
number of type three copies of H is at most 

|^j||Sj|((l + 6)npf <* (nV/J)2((l + 5)npf < i9nV 
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8 Proof of Lemma 14.41 



As earlier, set H' = H — viV2 and w' = wh'- Let 

Z= w'{a,b) <* {l + ^)^{H',n) = {l + ^)'^{H,n)/p, (35) 

a£A,beB 

where the inequahty is given by induction if d{v2) = k — 1 and by Lemma 
12.41 if d{v2) = k — 2. Let w"(a, b) = ^ w'(a, b, c), with the sum over 

{c G N{a) n N{b) : d{a, c), d{b, c) < np^/^. ^(^^ < (1 + ,5)np}, 

and note \N"{a,b) = w*(a, 6) if a6 G E{G) and a, 6 are not high degree, and 
otherwise \N*{ab) = 0. 

Thus 

y := Y.{w"{a,b)l^,,^EiG)} ■■ w"(a,6) < C} > J]{w*(a,6) : w*(a,6) < C}. 

In view of (j35p it's enough to show that under any conditioning on E{G) \ 
V{A,B) for which Z < {1 + 'd)'^{H,n)/p, 

Pr(y > {l + 2^)^{H,n)) < exp[-f]^(ny-^s)] (= exp[-!^^(/(A;, n,p))]). 

But under any such conditioning (or any conditioning on E{G) \ V(A, B)), 
the r.v.'s l{a6g_E{G)} ^I's independent; so, noting EY < < (1 + 1?)^(//, n) 
and using Lemma 13.31 we have 

Pr (y > (1 + 2i9)^'(/7, n)) < exp[-n4^{H, n)/C)] = exp[-l^^(nV~^s)]- 



9 Proof of Theorem [CT 

RecaU here = K^. Set r = = [2(^)^(2)]. Note that we only need 

to prove Theorem [L2] for small p, for simplicity say p < n~^/^'^~^^ log n, since 
above this f{k, n,p) = v?p^~^t and the theorem is given by the lower bound 
in @. It will thus be enough to show 

Proposition 9.1. For n~'^/(^~^^ < p < n"^/^'^"^) logn, 

Pr(C/f = r) > exp[-0(r)] 
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Proof. (This is an easy generahzation of the argument for k = 3 given in 
[6].) The number of sets S oi r vertex-disjoint copies of H in Kn is 

^ ._ {n)rk ^ ( (oa\ 
' r\{kiy ^ [rk^ J ■ 

For such an 5, let Qs and Rs be the events {G contains aU members of S} 

and {S is the set of iJ's of G}. We have Ft{Qs) = p^^^^ and will show (for 
any 5) 

Pr(i?5|Q5) = exp[-0(r)], (37) 

whence (using (p6]) ) 



VT{iH = r) > Y,PiiQs)PT{Rs\Qs) = sfi^)eM-Oir)] 



s 



> 1 exp[-0(r)] = exp[-0(r)]. 



"k^ 



For the proof of p7j) . fix S\ let W be the union of the vertex sets of the 
copies of H in S] and for i = 0, . . . ,k, let T{i) be the set of H^s (in Kn) 
having exactly i vertices outside W. We have 

PriRslQs) > (l-p)'^(°)in(l-p©+('=-^)^)™' (38) 

1=1 

= exp[— 0(r)]. 

Here the first inequality is given by Harris' Inequality [8] (which for our 
purposes says that for a product probability measure on {0,1}^ (with 
E a finite set) and events Ai C {0, 1}^ that are either all increasing or 
all decreasing, fi{nAi) > n^('^«))' t^i^ second we can use, say, 

|r(i)| < n\rkf~^ for < i < /c. (We omit the easy arithmetic, just noting 
that all factors but the last (that is, i = A;) in pSD are actually much larger 
than exp[— 0(r)].) 



10 Concluding Remarks 

Of course the big question is, what is the true behavior of the probability 
([1]) for general HI We continue to use for , and here confine ourselves 
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to 1] = 1; that is, we're interested in Pr(^// > 2E^//). As usual we don't ask 
for more than the order of magnitude of the exponent. 

One can show, mainly following the argument of Section [9l that for any 

K 

Pt{^h > 2B^h) > exp[-OH{^{K,n,p))] (39) 

(where, recall, ^{K,n,p) = n'"'^p'^^). As far as we can see, it could be that 
the truth in ([1]) is always given by the largest of the lower bounds in (|39p 
and (jl]). For the latter we (finally) define 

MH[n,p) - I ^^^^^^^^(^j^^^^p^y/a'^ if ^~l/m.^ < p < ^-l/A„ (4Uj 

(where, as usual, a* is fractional independence number; see e.g. or [3]). 
This is not quite the same as the quantity M|^(n,p) used in [11], but, as 
shown in their Theorem 1.5, the two agree up to a constant factor; so the 
difference is irrelevant here. 

Conjecture 10.1. For any H and p > n~^/^H , 

Vx{iH > 2BCh) = e^p[-nH{mm{mm^{K,n,p),MH{n,p)t})]. (41) 

We remark without proof (it is not quite obvious as far as we know) 
that, for a given H, the set of p for which the (outer) minimum in (j4ip is 
MH{n,p)t is the interval \pk, !]> where if is a smallest subgraph of H with 
rriK = ruH andpK is the uniquep for which ^{K,n,p) = M/f(n,p) log(l/p). 

Conj ecture llO.il gives a different perspective on the observation from [IH 
Section 8.1] that H = K2 shows that the lower bound in is not always 
tight. In this case MH{n,p) = n^p for the full range of p above and, of 
course, is just Bin((2),p); so the upper bound in is the truth. But 
in fact (I39p shows (with a little thought) that the lower bound in (j4]) is not 
tight for any H and sufficiently small p (> n~^^"^"), since for small enough 
p one of the terms ^{K,n,p) in (|4ip is o{MH{n,p)t). What's special about 
K2 is that it is the only (connected) H for which the best lower bound is 
never given by @; that is, the minimum in ()4ip is never MH{n,p)t. 

It also seems interesting to estimate 

Fvi^H > iB^h) (42) 

when 7 = 7(n) = w(l). The present results essentially do this for H = Kf^ 
and "generic" p; precisely. Theorem 12.3( b) implies (using a mild variant of 
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Proposition 12. ip 

Pr(CH > 2T^{H,n,p)) < exp[-n{f{k,nT^/'',p))], (43) 

which, for p in the range where f{k,nT^/'',p) = 'n?T'^l^p^~^t^ is (up to 
the constant in the exponent) the probability of containing a chque of size 
7),p('=-i)/2(^2T)^/'' (provided this is not more than ()!))• Of course the trick 
that gets Theorem 12.3( b) from Theorem 12. 3l f a) is general, so results on Con- 
jecture [lOTT] give corresponding upper bounds for ()32|) : but these bounds will 
not be tight in general, and at this writing we don't have a good guess as 
to the general truth in (|42p . 

A Proof of Proposition 12.21 

We may choose G* = G{kn,p) by first choosing G = G{n,p,H) and then 
letting 

E{G*) = E{G)US 

where Pr(xy £ S) = p whenever x ^ y, x £ Vi and y £ Vj for some ViVj 
E[H), these choices made independently. Write ^ and X for the numbers 
of copies of H in G* and G respectively (thus ^ = and X = X^^), and 
set ^* = ^ — X. Since EX = aE^, we have, using Harris' Inequality, 

Pr(e > (1 + f )Ee) > Pr(X > (1 + e)EX) Vi{C > - f E^); (44) 

so we need to say that the second probability on the right is at least 1/2. 
This is standard, but we summarize the argument for completeness. 
A result of Janson [9l (1.1)] gives 

Pr(r <Er-t) <exp[-g-], (45) 

with 

A = E:..E/,/.< E.^.E/./., (46) 

where (recycling notation a little) Hi, . . . are the copies of H in Kkn', la = 
IfiTaCG*}! "'^ ~ means Hcj and share an edge (so a ~ cr); and Yl* 
means we sum only over a, r for which H^j, cannot appear in G. 

But (very wastefully), 
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A < n""" ^{n'^H-fA'pSe^-eK : if c > 0} 

< n^^'^p^e^ ^{n~'"'{Cn~^/'^»)~^'' : K C H.ck > 0} 
= 0(C"iE2e), 

where C is the constant from ([5]), which may be taken large compared to 
the imphed constant in "0(-)." Thus, using (j45p with the above bound on 
A and t = (ae/2)E^, we find that the second probabihty on the right side 
of (jMl) is at most 1 - exp[-0((ae)2C)] > 1/2. 



B Proof of Lemma 12.51 

By Lemma 12.41 we may assume A := A// = k — \ (and wih write A in place 
of A; — 1 in this section). By Proposition 12.21 it's enough to show 

Pr(?^^ > (1 + ^mCif)) < 

which, in view of ([4]) and the definition of MH{n,p), will follow if we show 
that, for any K (1 H, n'"'^p^'^ = Q.{{p?p^t)'^*K), or, more conveniently, 

n^^-^^V^""^"^ =!^(i"*')- (47) 
We need one easy observation from jllj (see their Lemma 6.2): 

ex < ^{vk - aJc)- 

Then, noting that 

CK - ^a*K < (48) 

(since ck < A.vk/'^^ < ^o^k) using our upper bound on p, we find that 
the left side of (H71) is at least 

^iiK-2a^-(l+7)(eK-Aa|^)/A > ^DA'-2aJf-(«K-2a|^)+7(Aa|f-eK)/A 

which (again using (|18]) ) gives (H7|) . 
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